Acquisition Of Selectional Patterns
نویسندگان
چکیده
1 The Problem For most natural language analysis systems, one of the major hurdles in porting the system to a new domain is the development of an appropriate set of semantic patterns. Such patterns are typically needed to guide syntactic analysis (as selectional constraints) and to control the translation into a predicate-argument representation. As systems are ported to more complex domains, the set of patterns grows and the task of accumulating them manually becomes more formidable. There has therefore been increasing interest in acquiring such patterns automatically froin a sample of text in the domain, through an analysis of word co-occurrence patterns either in raw text (word sequences) or in parsed text. We briefly review some of this work later in the article. We have been specificaily concerned about the prac-ticality of using such techniques in place of manual encoding to develop the selectional patterns for new domains. In the experiments reported here, we have therefore been particularly concerned with the evaluation of our automatically generated patterns, in terms of their complete-hess and accuracy and in terms of their efficacy in performing selection during parsing. In principle, the semantic patterns could be stated in terms of individual words-this verb can meaningfully occur with this subject, etc. In practice, however, this would produce an unmanageable number of patterns for even a small domain. We therefore need to define semantic word classes for the domain and state our patterns in terms of these classes. Ideally, then, a discovery proeednre for semantic patterns would acquire both the word classes and the patterns from an analysis of the word co-occurrence patterns. In order to simplify the task, however, while we are exploring different strategies, we have divided it into separate tasks, that of acquiring word classes and that of acquiring semantic patterns (given a set of word classes). We have previously described [1] some experiments in which the principal word classes for a sublanguge were obtained through the clustering of words based on the contexts in which they occurred, and we expect to renew such experiments using the larger corpora now available. However, the experiments we report below are limited to the acquisition of semantic patterns given a set of manually prepared word classes. The basic mechanism of pattern acquisition is straightforward. A sample of text in a new domain is parsed using a broad-coverage grammar (but without any semantic constraints). The resulting parse …
منابع مشابه
A Neural Network Approach to Selectional Preference Acquisition
This paper investigates the use of neural networks for the acquisition of selectional preferences. Inspired by recent advances of neural network models for nlp applications, we propose a neural network model that learns to discriminate between felicitous and infelicitous arguments for a particular predicate. The model is entirely unsupervised – preferences are learned from unannotated corpus da...
متن کاملWord Sense Disambiguation For Acquisition Of Selectional Preferences
The selectional preferences of verbal predicates are an important component of lexical information useful for a number of NLP tasks including disambigliation of word senses. Approaches to selectional preference acquisition without word sense disambiguation are reported to be prone to errors arising from erroneous word senses. Large scale automatic semantic tagging of texts in sufficient quantit...
متن کاملDiscovery Procedures for Sublanguage Selectional Patterns: Initial Experiments
Selectional constraints specify, for a particular domain, the combinations of semantic classes acceptable in subject-verb-object relationships and other syntactic structures. These constraints are important in blocking incorrect analyses in natural language processing systems. However, these constraints are domain-specific and hence must be developed anew when a system is ported to a new domain...
متن کاملEvaluating and Combining Approaches to Selectional Preference Acquisition
Previous work on the induction of selectional preferences has been mainly carried out for English and has concentrated almost exclusively on verbs and their direct objects. In this paper, we focus on class-based models of selectional preferences for German verbs and take into account not only direct objects, but also subjects and prepositional complements. We evaluate model performance against ...
متن کاملAutomatic Selectional Preference Acquisition for Latin Verbs
We present a system that automatically induces Selectional Preferences (SPs) for Latin verbs from two treebanks by using Latin WordNet. Our method overcomes some of the problems connected with data sparseness and the small size of the input corpora. We also suggest a way to evaluate the acquired SPs on unseen events extracted from other Latin corpora.
متن کامل